57 resultados para transcriptome

em Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho"


Relevância:

20.00% 20.00%

Publicador:

Resumo:

A detailed genome mapping analysis of 213,636 expressed sequence tags (EST) derived from nontumor and tumor tissues of the oral cavity, larynx, pharynx, and thyroid was done. Transcripts matching known human genes were identified; potential new splice variants were flagged and subjected to manual curation, pointing to 788 putatively new alternative splicing isoforms, the majority (75%) being insertion events. A subset of 34 new splicing isoforms (5% of 788 events) was selected and 23 (68%) were confirmed by reverse transcription-PCR and DNA sequencing. Putative new genes were revealed, including six transcripts mapped to well-studied chromosomes such as 22, as well as transcripts that mapped to 253 intergenic regions. In addition, 2,251 noncoding intronic RNAs, eventually involved in transcriptional regulation, were found. A set of 250 candidate markers for loss of heterozygosis or gene amplification was selected by identifying transcripts that mapped to genomic regions previously known to be frequently amplified or deleted in head, neck, and thyroid tumors. Three of these markers were evaluated by quantitative reverse transcription-PCR in an independent set of individual samples. Along with detailed clinical data about tumor origin, the information reported here is now publicly available on a dedicated Web site as a resource for further biological investigation. This first in silico reconstruction of the head, neck, and thyroid transcriptomes points to a wealth of new candidate markers that can be used for future studies on the molecular basis of these tumors. Similar analysis is warranted for a number of other tumors for which large EST data sets are available.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription-PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Snake venom glands are a rich source of bioactive molecules such as peptides, proteins and enzymes that show important pharmacological activity leading to in local and systemic effects as pain, edema, bleeding and muscle necrosis. Most studies on pharmacologically active peptides and proteins from snake venoms have been concerned with isolation and structure elucidation through methods of classical biochemistry. As an attempt to examine the transcripts expressed in the venom gland of Bothrops jararacussu and to unveil the toxicological and pharmacological potential of its products at the molecular level, we generated 549 expressed sequence tags (ESTs) from a directional cDNA library. Sequences obtained from single-pass sequencing of randomly selected cDNA clones could be identified by similarities searches on existing databases, resulting in 197 sequences with significant similarity to phospholipase A(2) (PLA(2)), of which 83.2% were Lys49-PLA(2) homologs (BOJU-1), 0.1% were basic Asp49-PLA(2)s (BOJU-II) and 0.6% were acidic Asp49-PLA(2)s (BOJU-III). Adjoining this very abundant class of proteins we found 88 transcripts codifying for putative sequences of metalloproteases, which after clustering and assembling resulted in three full-length sequences: BOJUMET-I, BOJUMET-II and BOJUMET-III; as well as 25 transcripts related to C-type lectin like protein including a full-length cDNA of a putative galactose binding C-type lectin and a cluster of eight serine-proteases transcripts including a full-length cDNA of a putative serine protease. Among the full-length sequenced clones we identified a nerve growth factor (Bj-NGF) with 92% identity with a human NGF (NGHUBM) and an acidic phospholipase A2 (BthA-I-PLA(2)) displaying 85-93% identity with other snake venom toxins. Genetic distance among PLA(2)s from Bothrops species were evaluated by phylogenetic analysis. Furthermore, analysis of full-length putative Lys49-PLA(2) through molecular modeling showed conserved structural domains, allowing the characterization of those proteins as group II PLA(2)s. The constructed cDNA library provides molecular clones harboring sequences that can be used to probe directly the genetic material from gland venom of other snake species. Expression of complete cDNAs or their modified derivatives will be useful for elucidation of the structure-function relationships of these toxins and peptides of biotechnological interest. (C) 2004 Elsevier SAS. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Over 40,000 sugarcane (Saccharum officinarum) consensus sequences assembled from 237,954 expressed sequence tags were compared with the protein and DNA sequences from other angiosperms, including the genomes of Arabidopsis and rice (Oryza sativa). Approximately two-thirds of the sugarcane transcriptome have similar sequences in Arabidopsis. These sequences may represent a core set of proteins or protein domains that are conserved among monocots and eudicots and probably encode for essential angiosperm. functions. The remaining sequences represent putative monocot-specific genetic material, one-half of which were found only in sugarcane. These monocot-specific cDNAs represent either novelties or, in many cases, fast-evolving sequences that diverged substantially from their eudicot homologs. The wide comparative genome analysis presented here provides information on the evolutionary changes that underlie the divergence of monocots and eudicots. Our comparative analysis also led to the identification of several not yet annotated putative genes and possible gene loss events in Arabidopsis.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We report the results of a transcript finishing initiative, undertaken for the purpose of identifying and characterizing novel human transcripts, in which RT-PCR was used to bridge gaps between paired EST Clusters, mapped against the genomic sequence. Each pair of EST Clusters selected for experimental validation was designated a transcript finishing unit (TFU). A total of 489 TFUs were selected for validation, and an overall efficiency of 43.1% was achieved. We generated a total of 59,975 bp of transcribed sequences organized into 432 exons, contributing to the definition of the structure of 211 human transcripts. The structure of several transcripts reported here was confirmed during the course of this project, through the generation of their corresponding full-length cDNA sequences. Nevertheless, for 21% of the validated TFUs, a full-length cDNA sequence is not yet available in public databases, and the structure of 69.2% of these TFUs was not correctly predicted by computer programs. The TF strategy provides a significant contribution to the definition of the complete catalog of human genes and transcripts, because it appears to be particularly useful for identification of low abundance transcripts expressed in a restricted Set of tissues as well as for the delineation of gene boundaries and alternatively spliced isoforms.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Paracoccidioides brasiliensis is a fungal human pathogen with a wide distribution in Latin America. It causes paracoccidioidomycosis, the most widespread systemic mycosis in Latin America. Although gene expression in P. brasiliensis had been studied, little is known about the genome sequences expressed by this species during the infection process. To better understand the infection process, 4934 expressed sequence tags (ESTs) derived from a non-normalized cDNA library from P. brasiliensis (isolate Pb01) yeast-phase cells recovered from the livers of infected mice were annotated and clustered to a UniGene (clusters containing sequences that represent a unique gene) set with 1602 members. A large-scale comparative analysis was performed between the UniGene sequences of P. brasiliensis yeast-phase cells recovered from infected mice and a database constructed with sequences of the yeast-phase and mycelium transcriptome (isolate Pb01) (https://dna.biomol.unb.br/Pb/), as well as with all public ESTs available at GenBank, including sequences of the P. brasiliensis yeast-phase transcriptome (isolate Pb18) (http:// www.ncbi.nlm.nih.gov/). The focus was on the overexpressed and novel genes. From the total, 3184 ESTs (64.53%) were also present in the previously described transcriptome of yeast-form and mycelium cells obtained from in vitro cultures (https://dna.biomol.unb.br/Pb/) and of those, 1172 ESTs (23.75% of the described sequences) represented transcripts overexpressed during the infection process. Comparative analysis identified 1750 ESTs (35.47% of the total), comprising 649 UniGene sequences representing novel transcripts of P. brasiliensis, not previously described for this isolate or for other isolates in public databases. KEGG pathway mapping showed that the novel and overexpressed transcripts represented standard metabolic pathways, including glycolysis, amino acid biosynthesis, lipid and sterol metabolism. The unique and divergent representation of transcripts in the cDNA library of yeast cells recovered from infected mice suggests differential gene expression in response to the host milieu.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Whereas genome sequencing defines the genetic potential of an organism, transcript sequencing defines the utilization of this potential and links the genome with most areas of biology. To exploit the information within the human genome in the fight against cancer, we have deposited some two million expressed sequence tags (ESTs) from human tumors and their corresponding normal tissues in the public databases. The data currently define approximate to23,500 genes, of which only approximate to1,250 are still represented only by ESTs. Examination of the EST coverage of known cancer-related (CR) genes reveals that <1% do not have corresponding ESTs, indicating that the representation of genes associated with commonly studied tumors is high. The careful recording of the origin of all ESTs we have produced has enabled detailed definition of where the genes they represent are expressed in the human body. More than 100,000 ESTs are available for seven tissues, indicating a surprising variability of gene usage that has led to the discovery of a significant number of genes with restricted expression, and that may thus be therapeutically useful. The ESTs also reveal novel nonsynonymous germline variants (although the one-pass nature of the data necessitates careful validation) and many alternatively spliced transcripts. Although widely exploited by the scientific community, vindicating our totally open source policy, the EST data generated still provide extensive information that remains to be systematically explored, and that may further facilitate progress toward both the understanding and treatment of human cancers.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: Artificial selection has resulted in animal breeds with extreme phenotypes. As an organism is made up of many different tissues and organs, each with its own genetic programme, it is pertinent to ask: How relevant is tissue in terms of total transcriptome variability? Which are the genes most distinctly expressed between tissues? Does breed or sex equally affect the transcriptome across tissues?Results: In order to gain insight on these issues, we conducted microarray expression profiling of 16 different tissues from four animals of two extreme pig breeds, Large White and Iberian, two males and two females. Mixed model analysis and neighbor - joining trees showed that tissues with similar developmental origin clustered closer than those with different embryonic origins. Often a sound biological interpretation was possible for overrepresented gene ontology categories within differentially expressed genes between groups of tissues. For instance, an excess of nervous system or muscle development genes were found among tissues of ectoderm or mesoderm origins, respectively. Tissue accounted for similar to 11 times more variability than sex or breed. Nevertheless, we were able to confidently identify genes with differential expression across tissues between breeds (33 genes) and between sexes (19 genes). The genes primarily affected by sex were overall different than those affected by breed or tissue. Interaction with tissue can be important for differentially expressed genes between breeds but not so much for genes whose expression differ between sexes.Conclusion: Embryonic development leaves an enduring footprint on the transcriptome. The interaction in gene x tissue for differentially expressed genes between breeds suggests that animal breeding has targeted differentially each tissue's transcriptome.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Vampire bats are notorious for being the sole mammals that strictly feed on fresh blood for their survival. While their saliva has been historically associated with anticoagulants, only one antihemostatic (plasminogen activator) has been molecularly and functionally characterized. Here, RNAs from both principal and accessory submaxillary (submandibular) salivary glands of Desmodus rotundus were extracted, and ~. 200. million reads were sequenced by Illumina. The principal gland was enriched with plasminogen activators with fibrinolytic properties, members of lipocalin and secretoglobin families, which bind prohemostatic prostaglandins, and endonucleases, which cleave neutrophil-derived procoagulant NETs. Anticoagulant (tissue factor pathway inhibitor, TFPI), vasodilators (PACAP and C-natriuretic peptide), and metalloproteases (ADAMTS-1) were also abundantly expressed. Members of the TSG-6 (anti-inflammatory), antigen 5/CRISP, and CCL28-like (antimicrobial) protein families were also sequenced. Apyrases (which remove platelet agonist ADP), phosphatases (which degrade procoagulant polyphosphates), and sphingomyelinase were found at lower transcriptional levels. Accessory glands were enriched with antimicrobials (lysozyme, defensin, lactotransferrin) and protease inhibitors (TIL-domain, cystatin, Kazal). Mucins, heme-oxygenase, and IgG chains were present in both glands. Proteome analysis by nano LC-MS/MS confirmed that several transcripts are expressed in the glands. The database presented herein is accessible online at http://exon.niaid.nih.gov/transcriptome/D_rotundus/Supplemental-web.xlsx. These results reveal that bat saliva emerges as a novel source of modulators of vascular biology. Biological significance: Vampire bat saliva emerges as a novel source of antihemostatics which modulate several aspects of vascular biology. © 2013.